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Abstract — MIMO technology is one of the most significant 
advances in the past decade to increase channel capacity and has 
a great potential to improve network capacity for mesh networks. 
In a MIMO-based mesh network, the links outgoing from each 
node sharing the common communication spectrum can be 
modeled as a Gaussian vector broadcast channel. Recently, re- 
searchers showed that "dirty paper coding" (DPC) is the optimal 
transmission strategy for Gaussian vector broadcast channels. So 
far, there has been little study on how this fundamental result will 
impact the cross-layer design for MIMO-based mesh networks. 
To fill this gap, we consider the problem of jointly optimizing 
DPC power allocation in the link layer at each node and 
multihop/multipath routing in a MIMO-based mesh networks. 
It turns out that this optimization problem is a very challenging 
non-convex problem. To address this difficulty, we transform 
the original problem to an equivalent problem by exploiting 
the channel duality. For the transformed problem, we develop 
an efficient solution procedure that integrates Lagrangian dual 
decomposition method, conjugate gradient projection method 
based on matrix differential calculus, cutting-plane method, and 
subgradient method. In our numerical example, it is shown that 
we can achieve a network performance gain of 34.4% by using 
DPC. 

I. Introduction 

Since Telatar's [1] and Foschini's [2] pioneering works 
predicting the potential of high spectral efficiency provided 
by multiple antenna systems, the last decade has witnessed 
a soar of research activity on Multiple-Input Multiple-Output 
(MIMO) technologies. The benefits of substantial improve- 
ments in wireless link capacity at no cost of additional 
spectrum and power have quickly positioned MIMO as one 
of the breakthrough technologies in modern wireless com- 
munications, rendering it as an enabling technology for next 
generation wireless networks. However, applying MIMO in 
wireless mesh networks (WMNs) is not a trivial technical 
extension. With the increased number of antennas at each 
node, interference is likely to become stronger if power level 
at each node, power allocation to each antenna element, and 
routing are not managed wisely. As a result, cross-layer design 
is necessary for MIMO-based WMNs. 

In a MIMO-based WMN, the set of outgoing links from 
a node sharing a common communication spectrum can be 
modeled as a nondegraded Gaussian vector broadcast channel, 
for which the capacity region is notoriously hard to analyze 
[3]. In the networking literature, most works considering 
links sharing a common communication spectrum are con- 
cerned with how to allocate frequency sub-bands/time-slots 



and schedule transmissions to efficiently share the common 
communication spectrum. As an example, Fig. 1(a) shows a 
simple broadcast channel where there are three uncoordinated 
users and a single transmitting node. Suppose that messages 
x, y, and z need to be delivered to user 1, user 2, and user 3, 
respectively. Also, suppose that the received signals subject to 
ambient noise are x, y, and z, and the decoding functions are 
/i( - )> /2(0' anc l /3(')> respectively. The conventional strategy 
is to divide a unit time frame into three time slots (or divide 
a unit band into three sub-bands) t\, T2, and T3, and then 
find the optimal scheduling for transmissions to users 1, 2 
and 3, accordingly. The major benefit of this strategy is that 
interference can be eliminated. 




User 3 User 1 




User 2 
/,(>') lnT 2 



User 2 

/ 2 (>+4) 



(a) Time or frequency division (b) DPC transmission strategy 

Fig. 1. A 3-user broadcast channel example. 

Although the time or frequency division schemes are simple 
and effective, they are not necessarily the smartest strategy. 
In fact, Cover had shown in his classical paper [4] that the 
transmission scheme jointly encoding all receivers' informa- 
tion at the transmitter can do strictly better in broadcast chan- 
nels. However, the capacity achieving transmission signaling 
scheme for general nondegraded Gaussian vector broadcast 
channels is very difficult to determine and has become one 
of the most basic questions in network information theory 
[3]. Very recently, significant progress has been made in 
this area. Most notably, Weigarten et. al. finally proved the 
long-open conjecture that the "dirty paper coding" strategy 
(DPC) [5] is the optimal transmission scheme for Gaussian 
vector broadcast channels [6] in the sense that the DPC rate 
region Cdpc of a broadcast channel is equal to the broadcast 
channel's capacity region Cbcs i- e -> Cbc = Cdpc- However, 
this fundamental result is still not adequately exposed to the 
networking research community. So far, how to exploit DPC's 



benefits in the cross-layer design for wireless mesh networks 
has not yet been studied in the literature. The main objective 
of this study is to fill this gap and to obtain a rigorous and 
systematic understanding of the impact of applying DPC to 
the cross-layer optimization for MIMO-based mesh networks. 

To begin with, it is beneficial to introduce the basic idea 
of DPC, which turns out to be very simple. For the same 
3 -user example, consider the following strategy as shown 
in Fig. 1(b). We first jointly encode the messages for all 
the users in a certain order and then broadcast the resulting 
codeword simultaneously. Suppose that we pick user 1 to be 
encoded first, then followed by user 2, and finally user 3. 
We choose the codeword x for user 1 as before. Then, the 
interference seen by user 2 due to user 1 (denoted by x 2 ) 
is known at the transmitter. So, the transmitter can subtract 
the interference and encode user 2 as y' = y — x 2 rather 
than y itself. As a result, user 2 does not see any interference 
from the signal intended for user 1. Likewise, after encoding 
user 2, the interferences seen by user 3 due to user 1 and 2 
(denoted by x 3 and y 3 ) are known at the transmitter. Then, the 
transmitter can subtract the interferences and encode user 3 as 
z' = z — x 3 — j/3 rather than z itself. Therefore, user 3 does not 
see any interferences from the signals intended for user 1 and 
2. In the end, the transmitter adds all the codewords together 
and broadcasts the sum to all users simultaneously. As a result, 
it is easy to see from Fig. 1(b) that the received signal at user 
1 is x + iji + zi, i.e., user 1 will experience the interference 
from the signals intended for users 2 and 3; the received 
signal at user 2 is y + z 2 , i.e., user 2 only experiences the 
interference from the signal intended for user 3; and finally, the 
received signal at user 3 is z, i.e., user 3 does not experience 
any interference. This process operates like writing on a dirty 
paper, hence the name. Although counterintuitive, the capacity 
region of DPC that allows interference is strictly larger than 
those of time or frequency division schemes. 

After understanding what DPC is, one may ask two very 
natural and interesting questions: 

1) How will the enlarged capacity region at each node due 
to DPC impact the network performance in the upper 
layers? 

2) Are there any new challenges if DPC is employed in a 
MIMO-based networking environment? 

Notice that, when DPC is employed, the encoding order 
plays a critical role. For a if -user broadcast channel, there 
exists K\ permutations. Also, since DPC allows interference 
among the users, power allocation among different users 
along with the encoding order has a significant impact on 
the system performance. As we show later, the DPC link 
rates in a broadcast channel are non-connvex functions. Thus, 
even the optimization for a single if -user Gaussian vector 
broadcast channel is a very challenging combinatorial non- 
convex problem, not to mention the cross-layer design in a 
networking environment with multiple broadcast channels. 

In this paper, we aim to solve the problem of jointly 
optimizing DPC per-antenna power allocation at each node 
in the link layer and multihop/multipath routing in a MIMO- 



based WMN. Our contributions are three-fold. First, this paper 
is the first work that studies the impacts of applying DPC 
to the cross-layer design for MIMO-based WMNs. In our 
numerical example, it is shown that we can achieve a network 
performance gain of 34.4% by using DPC in MIMO-based 
WMNs. Also, since the traditional single-antenna systems can 
be viewed as a special case of MIMO systems, the findings 
and results in this paper are also applicable to conventional 
WMNs with single-antenna. Second, to address the non- 
convex difficulty, we transform the original problem to an 
equivalent problem under the dual MIMO multiple access 
channel (MIMO-MAC) and show that the transformed prob- 
lem is convex with respect to the input covariance matrices. We 
simplify the maximum weighted sum rate problem for the dual 
MIMO-MAC such that enumerating different encoding order 
is unnecessary, thus paving the way to efficiently solve the 
link layer subproblem in Lagrangian dual decomposition. Last, 
for the transformed problem, we develop an efficient solu- 
tion procedure that integrates Lagrangian dual decomposition 
method, conjugate gradient projection method based on matrix 
differential calculus, cutting-plane method, and subgradient 
method. 

The remainder of this paper is organized as follows. In 
Section II, we discuss the network model and problem for- 
mulation. Section III discusses how to reformulate the non- 
connvex original problem by exploiting channel duality. In 
Section IV, we introduce the key components for solving 
the challenging link layer subproblem in the Lagrangian 
decomposition. Numerical results are provided in Section V 
to illustrate the efficacy of our proposed solution procedure 
and to study the network performance gain by using DPC. 
Section VI reviews related work and Section VII concludes 
this paper. 

II. Network Model 

We first introduce notations for matrices, vectors, and com- 
plex scalars in this paper. We use boldface to denote matrices 
and vectors. For a matrix A, A^ denotes the conjugate trans- 
pose, Tr{A} denotes the trace of A, and | A| denotes the deter- 
minant of A. Diag{Ai , . . . , A„} represents the block diagonal 
matrix with matrices Ai, . . . , A„ on its main diagonal. We let 
I denote the identity matrix with dimension determined from 
context. A >z represents that A is Hermitian and positive 
semidefinite (PSD). 1 and denote vectors whose elements 
are all ones and zeros, respectively, and their dimensions are 
determined from context. (v) TO represents the m th entry of 
vector v. For a real vector v and a real matrix A, v > 
and A > mean that all entries in v and A are nonnegative, 
respectively. We let be the unit column vector where the i 
entry is 1 and all other entries are 0. The dimension of is 
determined from context as well. The operator "(, )" represents 
the inner product operation for vectors or a matrices. 

A. Network Layer 

In this paper, the topology of a MIMO-based wireless 
mesh network is represented by a directed graph, denoted 



by Q = {A/ - , £}, where J\f and C are the set of nodes 
and all possible MIMO-based links, respectively. By saying 
"possible" we mean the distance between a pair of nodes is 
less than or equal to the maximum transmission range D max , 
i.e., C = ■ D i3 < D max , i,j e TV, i ^ j}, where 

Dij represents the distance between node i and node j. D max 
can be determined by a node's maximum transmission power. 
We assume that Q is always connected. Suppose that the 
cardinalities of the sets M and C are \J\f\ = N and \C\ = L, 
respectively. For convenience, we index the links numerically 
(e.g., link 1, 2, . . . , L) rather than using node pairs 

The network topology of Q can be represented by a node- 
arc incidence matrix (NAIM) [7] A e R NxL , whose entry 
a n i associating with node n and arc I is defined as 

{1 if n is the transmitting node of arc I 
— 1 if n is the receiving node of arc I (1) 
otherwise. 

We define O (n) and 1 (n) as the sets of links that are 
outgoing from and incoming to node n, respectively. We use 
a multicommodity flow model for the routing of data packets 
across the network. In this model, several nodes send different 
data to their corresponding destinations, possibly through 
multipath and multihop routing. We assume that the flow 
conservation law at each node is satisfied, i.e., the network 
is a flow-balanced system. 

Suppose that there are F sessions in total in the network, 
representing F different commodities. The source and des- 
tination nodes of session /, 1 < / < F, are denoted as 
src(/) and dst(/), respectively. For the supply and demand 
of each session, we define a source-sink vector s/ € M. N , 
whose entries, other than at the positions of src(/) and dst(/), 
are all zeros. In addition, from the flow conservation law, we 
musthave (s/) src (^) = — (s/)dst(/)- Without loss of generality, 
we let (s/) src (/) > and simply denote it as a scalar Sf. 
Therefore, we can further write the source-sink vector of flow 
/ as 

s f = s f [ ■■■ 1 ••• -1 ••• ] T , (2) 

where the dots represent zeros, and 1 and —1 are in the 
positions of src(/) and dst(/), respectively. Note that for the 
source-sink vector of a session /, 1 does not necessarily appear 
before —1 as in (2), which is only for an illustrative purpose. 
Using the notation "= x ,y" to represent the component-wise 
equality of a vector except at the x th and the y th entries, 
we have Sf = src (/),dst(/) 0- I n addition, using the matrix 

to denote the collection 



session /. At node n, components of the flow vector and 
source-sink vector for the same commodity satisfy the flow 



conservation law as follows: Ez e o ( „) *; (/) - E ie j(„) h 
(s/)„, 1 < n < N, 1 < / < F. With NAIM, the flow 
conservation law across the whole network can be compactly 
written as At^ = s/, 1 < / < F. We use matrix T = 
[ tW t< 2 ) ... t< F ) ] e M. LxF to denote the collection 
of all flow vectors. With T and S, the flow conservation law 
can be further compactly written as AT = S. 

B. Channel Capacity of a MIMO Link 

In this section, we first briefly introduce some background 
of MIMO. We use a matrix Hf e C" rX,lt to represent 
the MIMO channel gain matrix from the transmitting node 
to the receiving node of link I, where n t and n r are the 
numbers of transmitting and receiving antenna elements of 
each node, respectively. H ; captures the effect of the scattering 
environment between the transmitter and the receiver of link 
I. In an additive white Gaussian noise (AWGN) channel, the 
received complex base-band signal vector for a MIMO link 
I with n t transmitting antennas and n r receiving antennas is 
given by 

yi = ^/plHixi + nj. (6) 



,(/) 



pNxF 



S = [ si s 2 ... sp 
of all source-sink vectors, we further have 

Se/ = src (/),dst(/) 0, 1 < / < F, (3) 
(l,S e/ )=0, l<f<F, (4) 
(S e/ ) src(/) = s f , l<f<F, (5) 

where e* is the f th unit column vector. 

( f) 

On link I, we let t\ > be the amount of flow of session 
/ in link /. We define e M L as the flow vector for 



where y ; and x ; represent the received and transmitted signal 
vector; n; is the normalized additive white Gaussian noise 
vector; p\ captures the path-loss effect, which is usually 
modeled as pi = G ■ Dj~ a , where G is some system specific 
constant, Di denotes the distance between the transmitting 
node and the receiving node of link I, and a denotes the path 
loss exponent. Let matrix Q; represent the covariance matrix 
of a zero-mean Gaussian input symbol vector x; at link I, 
i.e., Q; = e|x;-x||. This implies that Q; is Hermitian 
and Q; y 0. Physically, Q; represents the power allocation 
in different antenna elements in link I's transmitter and the 
correlation between each pair of the transmit and receive 
antenna elements. Tr{Q;} is the total transmission power at 
the transmitter of link I. The capacity of a MIMO link I in an 
AWGN channel with a unit bandwidth can be computed as 



RiiQi) = iog 2 



I + pjHiQjH 



(7) 



It can be seen that different power allocations to the antennas 
will have different impacts on the link capacity. Therefore, the 
optimal input covariance matrix Q)* needs to be determined. 
In a single link environment, the optimal input covariance 
matrix can be computed by water-filling the total power over 
the eigenmodes (signaling direction) of the MIMO channel 
matrix [1]. However, in a networking environment, finding 
the optimal input covariance matrices is a substantially more 
challenging task. Determining the optimal input covariance 
matrices is one of the major goals in our cross-layer opti- 
mization. 

C. MIMO-BC Link Layer 

A communication system where a single transmitter sends 
independent information to multiple uncoordinated receivers is 



referred to as a broadcast channel. If the channel gain of each 
link in the broadcast channel is a matrix and the noise to each 
link is a Gaussian random vector, the channel is termed "Gaus- 
sian vector broadcast channel". Fig. 2 illustrates a if -user 
Gaussian vector broadcast channel, where independent mes- 
sages W\ , . . . , Wk are jointly encoded by the transmitter, and 
the receivers are trying to decode W\, . . . , Wk, respectively. 
A (n, 2 nRl , . . . , 2 uRk )bc codebook for a broadcast channel 
consists of an encoding function x n (Wi, . . . , Wk ) where 
Wi e {1, . . . , 2 nR >}, i = 1, 2, . . . , K. The decoding function 
of receiver i is M^(y") . An error occurs when Wi ^ W t . 
A rate vector R = . . . , Rk] T is said to be achievable if 
there exists a sequence of (n, 2 nRl , . . . , 2 ,iRk )bc codebooks 
for which the average probability of error P e — > as the code 
length n — > oo. The capacity region of a broadcast channel is 
defined as the union of all achievable rate vectors [3]. Gaussian 
vector broadcast channel can be used to model many different 
types of systems [3]. Due to the close relationship between 
Gaussian vector broadcast channel and MIMO, we will call 
the Gaussian vector broadcast channel in the MIMO case as 
MIMO-BC throughout the rest of this paper. 
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Fig. 2. A Gaussian vector broadcast channel. 

For clarity, we use I\ to specifically denote the input 
covariance matrix of link i in a MIMO-BC, and Qj to denote 
an input covariance matrix in other types of MIMO channels. 
From the encoding process of DPC, the achievable rate in DPC 
scheme can be computed as follows: 



R-kH) = log 









I + Hx(i) 




H t (i ) 



(8) 



where 7r denotes a permutation of the set {1, . . . , K}, 
represents the i th position in permutation ir. One important 
observation of the dirty paper rate equation in (8) is that the 
rate equation is neither a concave nor a convex function of the 
input covariance matrices Ti, i = 1,2, ... ,K. 

Let H = [Hi, . . . ,H^] T be the collection of K channel 
gain matrices in the MIMO-BC, and T = [T 1 . . . , T K ] be 
the collection of K input covariance matrices. We define the 
dirty paper region Cdpc{P, H) as the convex hull of the 
union of all such rates vectors over all positive semidefinite 
covariance matrices T\, . . . , Tk satisfying Ti{^2d—i Ti} < P 
(the maximum transmit power constraint at the transmitter) 



and over all Kl permutations: 

C DPC (P,H) 4 Cov (u 7r; rR BC ( 7 r,r)) , 
where Cov(-) represents the convex hull operation. 

D. Problem Formulation 

In this paper, we aim to solve the problem of jointly 
optimizing DPC per-antenna power allocation at each node 
in the link layer and multihop/multipath routing in a MIMO- 
based WMN. Suppose that each node in the network has 
been assigned a certain (possibly reused) frequency band that 
will not cause interference to any other node in the network. 
Also, the incoming and outgoing bands of each node are 
non-overlapping such that each node can transmit and receive 
simultaneously. How to perform channel assignments is a huge 
research topic on its own merits, and there are a vast amount 
of literature that discuss channel assignment problems. Thus, 
in this paper, we focus on how to jointly optimize routing 
in the network layer and the DPC power allocation in the 
link layer for each node when a channel assignment is given. 
We adopt the well-known proportional fairness utility function, 
i.e., ln(s/) for flow /. In CRPA, we wish to maximize the sum 
of all utility functions. In the link layer, since the total transmit 
power of each node is subject to a maximum power constraint, 
we have £ ;e0(n) Tr{r z } < P^L 1 < n < N, where 
represents the maximum transmit power of node n. Since the 
total amount of flow in each link I cannot exceed its capacity 
limit, we must have ^/=i t\ S) < Ri(T), 1 < I < L. This can 
be further compactly written using matrix-vector notations as 
(l,T T e ; ) < Ri(T), 1 < I < L. Coupling the network layer 
model in Section II-A and MIMO-BC link layer model in 
Section II-C, we have the problem formulation for CRPA as 
in (9). 

CRPA: 



Maximize X^/=i m ( s /) 
subject to AT = S 
T > 

Se / =src(/),dst(/) 

(l,Se/)=0 
(Se/) src(/) = Sf 
(l,T T e t ) <Ri(T) 



V/ 
V/ 
V/ 
VI 



(9) 



RdT) e (Sc^.HW) VI EO(n) 



"DPC 

Ti h 

Variables: S, T, T 



(n) 
max 



Vn 
VI 



III. Reformulation of CRPA 

As we pointed out earlier, the DPC rate equation in (8) is 
neither a concave nor a convex function of the input covariance 
matrices. As a result, the cross-layer optimization problem in 
(9) is a non-convex optimization problem, which is very hard 
to solve numerically, let alone analytically. However, in the 
following, we will show that (9) can be reformulated as an 
equivalent convex optimization problem by projecting all the 
MIMO-BC channels onto their dual MIMO multiple-access 



channels (MIMO-MAC). We first provide some background 
of Gaussian vector multiple access channels and the channel 
duality between MIMO-BC and MIMO-MAC. 

A. MIMO-MAC Channel Model 

A communication system where multiple uncoordinated 
transmitters send independent information to a single receiver 
is referred to as a multiple access channel. If the channel 
gain of each link in the multiple access channel is a matrix 
and the noise is a Gaussian random vector, the channel is 
termed "Gaussian vector multiple access channel". Fig. 3 
illustrates a if -user Gaussian vector multiple access chan- 
nel, where independent messages W\, . . . , Wk, are encoded 
by transmitters 1 to K, respectively, and the receiver is 
trying to decode W U ...,W K . A (n, 2 nfll , . . . , 2 nRK ) MAC 
codebook for a multiple access channel consists of encoding 
functions x?(Wi),. . . ,x K (W K ) where W t E {1, . . .,2 nRi }, 
i = 1, 2, . . . , K. The decoding functions at the receiver are 
Wi(y™), i = 1, 2, . . . , K. An error occurs when Wi ^ Wi 
A rate vector R = \R\ , . . . , Rk 
there exists a sequence of (n, 2 nR 
for which the average probability of error P e — > as the 
code length n — > oo. The capacity region of a multiple 
access channel, denoted by Cmac is defined as the union of 
all achievable rate vectors [3]. We call the Gaussian vector 
multiple access channel in the MIMO case as MIMO-MAC 
throughout the rest of this paper. 



T is said to be achievable if 
2"^)mac codebooks 
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W K e 2" 
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Fig. 3. A Gaussian vector multiple access channel. 



B. Duality between MIMO-BC and MIMO-MAC 

The dual MIMO-MAC of a MIMO-BC can be constructed 
by changing the receivers in the MIMO-BC into transmitters 
and changing the transmitter in the MIMO-BC into the re- 
ceiver. The channel gain matrices in dual MIMO-MAC are the 
conjugate transpose of the channel gain matrices in MIMO- 
BC. The maximum sum power in the dual MIMO-MAC is the 
same maximum power level as in MIMO-BC. The relationship 
between a MIMO-BC and its dual MIMO-MAC is illustrated 
in Fig. 4. Similar to MIMO-BC, We denote the capacity region 
of the dual MIMO-MAC as C M Ac(-P,Ht). The following 
Lemma states the relationship between the capacity regions 
of a MIMO-BC and its dual MIMO-MAC. 
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Fig. 4. The relationship between MIMO-BC and its dual MIMO-MAC 



Lemma 1: The DPC region of a MIMO-BC channel with 
maximum power constraint P is equal to the capacity region 
of the dual MIMO MAC with sum power constraint P 

C DPC (P,H) =C M Ac(-P,Ht). 
Proof: The proof of this theorem can be arrived in various 
ways [8]— [10]. The most straightforward approach is to show 
that any MIMO-BC achievable rate vector is also achievable 
in its dual MIMO-MAC and vice versa. The MAC-to-BC and 
BC-to-MAC mappings can be found in [8]. It is also shown 
in [8] that any rate vector in a MIMO-BC with a particular 
encoding order can be achieved in its dual MIMO-MAC with 
the reversed successive decoding order. ■ 

C. Convexity of MIMO-MAC Capacity Region 

From Lemma 1, we know that the capacity region of a 
MIMO-BC and its dual MIMO-MAC is exactly the same. 
Therefore, we can replace Cdpc(') m (9) by the capacity 
regions of the dual MIMO-MAC channels Cmac(')- Tne 
benefits of such replacements is due to the following theorem. 

Theorem 1: The capacity region of a K-usei MIMO-MAC 
channel with a sum power constraint Y^f=i Tr(Qi) < P ma x 
is convex with respect to the input covariance matrices 

Qi, ...,Qk. 

Proof: Denote the input signals of the K users by 
xi, . . . , xjf, respectively, and denote the output of the MIMO- 
MAC channel by y. Since pi is a scalar, we absorb pi into H; 
in this proof for notation convenience. Theorem 14.3.5 in [3] 
states that the capacity region of a MIMO-MAC is determined 

by 



Cmac(Qi) ■ • ■ ! Qk) 



Cov < 



(Ri, . . . , Rk) 



VSC {!,..., K} 



max ) 



) , do) 



where the mutual information expression 7(; ) can be bounded 
as follows: 



I(xi,i e 5;y|xj,z e S c ) < log 



i + ^h1q 4 h, 

ies 



(11) 



To show that the capacity region of the MIMO-MAC with and 
a sum power constraint is convex, it is equivalent to show 
that the convex hull operation in (10) is unnecessary. To show 
this, consider the convex combination of two arbitrarily cho- 
sen achievable rate vectors [Ri, . . . , Rk] and [Ri, . . . , Rk] 
determined by two feasible power vectors [Qi, . . . , Q^-] and 
[Qi, . . . , Qk], respectively, i.e., we 

-Pmax an d 2~2iLi Tr(Qi) < P m ax- Let < a < 1 and consider 
the convex combination 

[R u ...,R K ] = a[R u ...,R K } + {l-a)[R 1 ,...,R K }. 



Also, let Qi = aQi + (1 — a)Qi, i = 1, . . . , K. It is easy to 
verify that Eti Tr(Q 4 ) < Pmaxi i- e -> the convex combination 
of two feasible power vectors is also feasible. Now, consider 



a ^ Ri + (1 — a) Ri < a log 

ies 



ies 



I + ^HtQ,H, 

ies 



+(l-a) log 



Since the function log |A| is a concave function for any 
positive semidefinite matrix variable A [3], it follows from 
Jensen's inequality that 



a^i?. t + (l-a)^4< ^log 

ies ies 



i + ^hJq.h, 

ies 



which means that the convex combination of rate vectors 
[Ri, . . . , Rk] and [R\, . . . , Rk] can also be achieved by using 
the feasible power vector [Qi, . . . , Qk] directly. As a result, 
the convex hull operation is unnecessary. ■ 

D. Maximum Weighted Sum Rate Problem of the Dual MIMO- 
MAC 

Now, we consider the maximum weighted sum rate problem 
of the dual MIMO-MAC. We simplify this problem such 
that we do not have to enumerate all possible successive 
decoding order in the dual MIMO-MAC, thus paving the way 
to efficiently solve the link layer subproblem we discuss in 
Section IV. 

Theorem 2: Associate each rate Ri in MIMO-MAC a non- 
negative weight Ui, i = 1,...,K, the maximum weighted 
sum max^- =1 Ri(Q) can be solved by the following convex 
optimization problem: 



Maximize J2?=i( u iv(i) - *M»-i))x 



log 



s-^K 



H 



subject to E^i^Qi) < 

Qi h0, i = l,...,K, 



(12) 



where tt T ™ = 0, ir(i),i = 1,...,K is a permutation on 
{1,...,K} such that u v m < ••• < u v /k)- l n particular, 
suppose that (Q*^), • • • , Q^k)) sorves (12), then the optimal 
rates of (12) are given by 



R 



log 



Ptt(K)^ 



Qt(K) H K 



(13) 



log 



1 + EjLi+i ^o) H t(j)Q^o) H J 



(14) 



for i = 1,2, .. .,K- 1. 



Proof: For convenience, we let 



$(S) = log 



I + £ H t(i)Qir(i) H ir 



(0 



ies 



Since tt{i) is simply a permutation on {1, ... , K}, from (10) 
and (11) we have the maximum weighted sum rate problem 
can be written as 

Maximize Y.f=i 

subject to J2 t es R *d) < V5 C {1, . . . , Jf }. 

Also from (10) and (11), it is easy to derive that R w ^ < 
$({*"(*)}) = log I + p w (i)H^ (i) Q w(i) H ffW . Since u w{1) < 
■ ■ ■ < u tt(k)> from KKT condition, we must have that the 
constraint R v [k) = ®({k(K)}) must be tight at optimality. 
That is, 

Rtt(k) — log I + p 7r (K)H] r( . K) Q 7r ( if )H 7r ( if ) . (15) 
Again, from (10) and (11), we have 

R-k(K-I) + Rtt(K) < log I + PifjqH^^QTfjqHjfjf) 



So, 



+P7r(K-l)H 7r(K _ 1) Q 7I -(j ( -_i)H 7r ( A: -l) 



Rir(K~l) < log I + Pw(A-)H w ( Ji -)Q w (ji-)H 1I .^ A .^ 
+Ptt(K-1)W-k(K-1)Q-k(K-1)K\^ k _ 1) 



log 



I + /37r(x)H 7r( _ ft :)Q T ( A :)H^ (K) 



(16) 



Since u v / K _i\ is the second largest weight, again from KKT 
condition, we must have that (16) must be tight at optimality. 
This process continues for all K users. Subsequently, we have 
that 



R n{i) = log I + PttOoH^Q^H 

- log i + EjLi+i ^o) h L-)QttO)H 



, (17) 



for i = 1, . . . , K — 1. Summing up all u^^R^^) and after 
rearranging the terms, it is readily verifiable that 



K 



K 



log 



I + Xl p ' r O) H to-) Q ' r O) H ' r 



Cj) 



(18) 



It then follows that the maximum weighted sum rate problem 
of MIMO-MAC is equivalent to maximizing (18) subject to 
the sum power constraint, i.e., the optimization problem in 



(12). Since log | - J is a concave function of positive semidefinite 
matrices, (18) is a convex optimization problem with respect to 
Q,r(i) , • • • , Q-k(k)- After we obtain the optimal solution power 
solution (Q*/!), • • • , QJr(K))' me corresponding link rates can 
be computed by simply following (15) and (17). ■ 

E. Problem Reformulation 

We now reformulate CRPA by replacing Cdpc m (9) with 
Cmac> an d we denote the equivalent problem by CRPA-E. 
After solving CRPA-E, we can recover the corresponding 
MIMO-BC covariance matrices T* from the optimal solution 
Q* of CRPA-E by the MAC-to-BC mapping provided in [8]. 

CRPA-E: 



Maximize X)/=i m ( s /) 
subject to AT = S 
T > 

Se/ = S rc(/),dst(/) 

(i,s e/ ) = o 

(Se/) src(/) = s f 
(l,T T e ; ) <Ri(Q) 







(19) 



HQ) e 4"ic(^ n) 



V/ 
v.f 
v/ 

VI 

,Ht(™)) VI eO(n) 

YsleO(n) Tr {Qi} < -Pmal Vn 

Qz^O VI 
Variables: S, T, Q 

IV. Solution Procedure 

Since CRPA-E is a convex programming problem, we can 
solve CRPA-E exactly by solving its Lagrangian dual problem. 
Introducing Lagrangian multipliers Ui to the link capacity 
coupling constraints (1, T T e;) < i?;(Q), Hence, we can write 
the Lagrangian as 



e(u) 



sup {L(S,T,Q,u)|(S, T, Q) e *} , (20) 

S,T\Q 



where 



L(S, T, Q, u) = ^ In ( 8f ) + J> (MQ) - (1. T r e,» 

/ l 



and * is defined as 



(S,T,Q) 



AT = S 
T > 

Se 7 = src (/),dst(/) 
(l,Se/)=0 
(Se/) src{/) = s f 

E leOW »{Ql}<^al 

QihO 

Ri(Q) eCMAo^iSL.HtW) 



v/ 
v/ 
v/ 

Vn 

v; 

Vn 



The Lagrangian dual problem of CRPA can thus be written 

as: 

d crpa-e . Minimize e( u ) 
subject to u > 0. 

It is easy to recognize that, for a given u, the Lagrangian in 
(20) can be rearranged and separated into two terms: 

9(u) = 6 ne t(u) +6link(u), 



where, for a given Lagrangian multiplier u, 9 no t and 6n n k 
are corresponding to network layer and link layer variables, 
respectively: 

0ST A ~ E : ©net(u) = Maximize In (s f ) 

-E,«i<l,T T e I ) 
subject to AT = S 
T > 

Se / =src(/),dst(/) V/ 

(l,S e/ ) = Vf 
(Se/) src(/) = s f Vf 
Variables: S, T 



D 



CRPA-E 
link 



©link(u) = 

subject to 



Variables: 



Maximize^ mz-Rz(Q) 

SzeO(n) Tr{Q(} < Pma> 

Qi h 

Ri{Q) e C M Ac(-P4l- ) x,Ht(™)) 

VI eO(n),ne N 

Q 



Vn 
VI 



The CRPA-E Lagrangian dual problem can thus be written 
as the following master dual problem: 

md crpa-e . Mimmize e net ( u ) + e Unk (u) 

subject to u > 

Notice that link (u) can be further decomposed on a node- 
by-node basis as follows: 



Bii„k(u) = max^u ( i? ( (Q) 



N 



]T I max ]T «,J*(Q)I =£e<ft(u<»>) 



(21) 



It is seen that © 1 ( i " ) k (u(") 



) = max^, eOW uiRi(Q) is a 
maximum weighted sum rate problem of the dual MIMO- 
MAC for some given dual variables u'") as weights. Without 
loss of generality, suppose that node n has K outgoing links, 
which are indexed as 1, . . . , K and are associated with dual 
variables m, . . . , Uk, respectively. Let € {1, . . . , K} be 
the permutation such that < w^i) < ■ • ■ < u -k(k) an d define 



Mo) 



o. ew (u w 



) can be written as follows: 



Maximize X) i=1 (ti w (i) - *M»-i))>< 

log I + . p^Ul^Q^U^ 

subject to Tr(Qi) < 

QihO, i = l,...,K. 



(22) 



Note that in the network layer subproblem nct (u), the 
objective function is concave and all constraints are affine. 
Therefore, net (u) is readily solvable by many polynomial 
time convex programming methods. However, even though 
<9iink( u< ^) is also a convex problem, generic convex pro- 
gramming methods are not efficient because the structures of 
its objective function and constraints are very complex. In the 
following subsections, we will discuss in detail how to solve 

e£L(u<»>). 



A. Conjugate Gradient Projection for Solving ©[^(u^) 

We propose an efficient algorithm based on conjugate gradi- 
ent projection (CGP) to solve (22). CGP utilizes the important 
and powerful concept of Hessian conjugacy to deflect the 
gradient direction appropriately so as to achieve the superlinear 
convergence rate [11], which is similar to that of the well- 
known quasi-Newton methods (e.g., BFGS method). In each 
iteration, CGP projects the conjugate gradient direction to find 
an improving feasible direction. The framework of CGP for 
solving (22) is shown in Algorithm 1. 



Algorithm 1 Gradient Projection Method 

Initialization: 

Choose the initial conditions Q(°) = [QY , Q2 , • 
k = 0. 
Main Loop: 



,Q 



(0),T 



. Let 



1. Calculate the conjugate gradients G 



(fc) 



2. Choose an appropriate step size s^. Let Q. 
for i = 1, 2, . . . , K. 

3. Let Q< fe ) be the projection of Q'CO onto fl + (P^). 

4. Choose an appropriate step size a k . Let Q ; v = Q ; 
a*(Qj fc) -Q\ k) ),i = l,2,...,K. 



(k) 



5. k = k + 1. If the maximum absolute value of the elements in Q . 

(k-Vi % 
< e, for i = 1, 2, . . . , L, then stop; else go to step 1. 



We adopt the "Armijo's Rule" inexact line search method 
to avoid excessive objective function evaluations, while still 
enjoying provable convergence [11]. For convenience, we 
use F(Q) to represent the objective function in (22), where 
Q = (Qi; • ■ • 7 Qif ) denotes the set of covariance matrices at 
a node. According to Armijo's Rule, in the k th iteration, we 
choose 0-/0 = 1 and a k = (i mk (the same as in [12]), where 
m k is the first non-negative integer m that satisfies 

F(Q (fc+1) ) - F(Q W ) > cr/? m (G (fc) , Q (fe) - Q w ) 



K 

= a(3 m Tr [g 



t(fe) 



Qf ) - Qf r 



(23) 



where < < 1 and < a < 1 are fixed scalars. 

B. Computing the Conjugate Gradients 

The gradient G v {j) = VQ x(j) F(Q) depends on the partial 
derivatives of F(Q) with respect to Q ff (j). By using the 
formula ai"|A+BXC| = [C(A + BXC)-^] T [12], [13], 
we can compute the partial derivative of the i th term in the 
summation of F(Q) with respect to Q w y), j > i, as follows: 



i) 



(«7T(i) 



log 



i 7r(i-l 



A' 



))x 



X^( fe ) H l(fc)Q?rM H ^« 



U 7T(i-l) 



-1 T 



H 



;fc) H t(fc)QT(fc) H "(fe) 



Ht , 



k—i 



To compute the gradient of F(Q) with respect to Q^y), we 
notice that only the first j terms in F(Q) involve Q^m. From 
the definition V z f(z) = 2(df(z)/dz)* [14], we have 



G 7T(j) - 2p w (j)H ff y) 



A' 



(fc) 



Remark 1: It is important to point out that we can exploit 
the special structure in (24) to significantly reduce the com- 
putation complexity in the implementation of the algorithm. 
Note that the most difficult part in computing G n (j) is the 
summation of the terms in the form of H^ fe ^Q 7r ( fe )H 7r ( fe ). 
Without careful consideration, one may end up computing 
such additions j(2K + 1 — j)/2 times for G n (j). However, 
notice that when j varies, most of the terms in the summation 
are still the same. Thus, we can maintain a running sum for 
1 + EfcLi P7r(fc)H| r(fe) Q 7r(fc) H 7r(fe) , start out from j = K, and 
reduce j by one sequentially. As a result, only one new term 
is added to the running sum in each iteration, which means 
we only need to do the addition once in each iteration. 

The conjugate gradient direction in the m th iteration can 
be computed as G^?\ — G^i + K m G| r ™^ 1 ' ) . We adopt the 
Fletcher and Reeves' choice of deflection [11], which can be 
computed as 

i|Q(m) - 

n m = r U) :: . (25) 



|G 



The purpose of deflecting the gradient using (25) is to find 
G^jy which is the Hessian-conjugate of G^?^ 1 \ By doing 
so, we can eliminate the "zigzagging" phenomenon encoun- 
tered in the conventional gradient projection method, and 
achieve the superlinear convergence rate [11] without actually 
storing a large Hessian approximation matrix as in quasi- 
Newton methods. 

C. Projection onto Cl+(PmaL) 

Noting from (24) that G T /ji is Hermitian, we have that 



q}£ - Q (fe) 



ShG^)., is Hermitian as well. Then, the 
projection problem becomes how to simultaneously project 
\0 (n) | Hermitian matrices onto the set 



(n) 



J2l Tr {Qi} < Pi 

QihO, leO (n) 



This problem belongs to the class of "matrix nearness prob- 
lems" [15], [16], which are not easy to solve in general. 
However, by exploiting the special structure in ©{"^(u), we 
are able to design a polynomial-time algorithm. 

We construct a block diagonal matrix D = 

EHag{Q ff (l)...Q,(A-)} G £(K-n r )x(K-n r )^ ft j s easy tQ 

recognize that Q^m e fi+(P4l«), j = l,...,K, only if 



Tr(D) = Ej=iTr(Q 7r0) ) < P£L and D h 0. We use 
Frobenius norm, denoted by || • as the matrix distance 



criterion. The distance between two matrices A and B is 
defined as ||A - B|| F = (Tr [(A - B)t(A - B)]) Thus, 
given a block diagonal matrix D, we wish to find a matrix 
D e fi+(Pmax) such that D minimizes ||D - B\\ F . For 
more convenient algebraic manipulations, we instead study 
the following equivalent optimization problem: 



Minimize 



HID 



subject to Tr(D) < P^ x , D^O. 



(26) 



In (26), the objective function is convex in D, the constraint 
D y represents the convex cone of positive semidefinite 
matrices, and the constraint Tr(D) < is a linear 

constraint. Thus, the problem is a convex minimization prob- 
lem and we can exactly solve this problem by solving its 
Lagrangian dual problem. Associating Hermitian matrix II to 
the constraint D >; and fi to the constraint Tr(D) < Pm™L 
we can write the Lagrangian as g(Tl, /i) = min I = ) {(l/2)||D — 
D||| - Tr(ntD) + /i(Tr(D) - P^l)}. Since g{U,n) is 
an unconstrained convex quadratic minimization problem, we 
can compute the minimizer of the Lagrangian by simply 
setting its first derivative (with respect to D) to zero, i.e., 
(p - D) - nt + (jl = 0. Noting that W = II, we have 
D = D — ^1 + II. Substituting D back into the Lagrangian, 
we have 

g (u, (j,) = -1||d - M i + n\\% - /xPil + \\\m 2 - 

Therefore, the Lagrangian dual problem can be written as 

Maximize -±||D - /xl + Tl\\ 2 F - fiP^ + ±||D|| 2 
subject to II y 0, ijl > 0. 

(27) 

After solving (27), we can have the optimal solution to (26) 
as 

D* = D fj,*I + II*, 

where ^* and II* are the optimal dual solutions to Lagrangian 
dual problem in (27). We now consider the term D — [il + 
II, which is the only term involving II in the dual objective 
function. From Moreau Decomposition [17], we immediately 
have 

fjl + U\\ F = (D- M I) + , 



min 1 1 D 
n 



where the operation (A) + means performing eigenvalue de- 
composition on matrix A, keeping the eigenvector matrix 
unchanged, setting all non-positive eigenvalues to zero, and 
then multiplying back. Thus, the matrix variable II in the 
Lagrangian dual problem can be removed and the Lagrangian 
dual problem can be rewritten as 



Maximize VG") — — ^ || (D — — /iP r 

subject to \i > 0. 



(n) 
max 



(28) 



Suppose that after performing eigenvalue decomposition on D, 
we have D = UAU^, where A is the diagonal matrix formed 
by the eigenvalues of D, U is the unitary matrix formed by 
the corresponding eigenvectors. Since U is unitary, we have 

(D- M I) + =U(A- M I) + Ut. 



It then follows that 

\\{n-M + \\ 2 F = \\(A-M + \\ 2 F . 

We denote the eigenvalues in A by A,, i = 1, 2, . . . , K ■ n r . 
Suppose that we sort them in non-increasing order such that 
A = Diag{Ai A 2 ... Ax.„ r }, where Ai > . . . > X Knr - It 
then follows that 

K-n T 

||(A-/d)X= (max^A,-^}) 2 . 
So, we can rewrite as 



1 



K-, 



1p(fl) =-^Yl ( maX {°> X 3 - ^}) 2 - M-P, 



(n) 
max' 



(29) 



It is evident from (29) that ip(n) is continuous and (piece-wise) 
concave in fi. Due to this special structure, we can search the 
optimal value of \i as follows. Let / index the pieces of V'(m). 
I = 0, 1, . . . , K ■ n r . Initially we set I = and increase / 
subsequently. Also, we introduce A = oo and \K-n r +i = 
— oo. We let the endpoint objective value tpj (Ao) = 0, </>* = 
ipf (Ao), and \i* = Ao- If / > K ■ n r , the search stops. For a 
particular index /, by setting 



(») 



= 0, 



we have 



(n) 
max 



Now we consider the following two cases: 

1) If fi*- € [A/ +1 ,Aj>] n R + , where K + denotes the set 
of non-negative real numbers, then fj,*- is the optimal 
solution because V(m) i s concave in \i. Thus, the point 
having zero-value first derivative, if exists, must be the 
unique global maximum solution. Hence, we can let 
M* = fj,*- and the search is done. 

2) If $ [A/ +1 , A/] n M+, we must have that the local 
maximum in the interval [Aj +1 , Aj] fll + is achieved at 
one of the two endpoints. Note that the objective value 
tpj (Aj) has been computed in the previous iteration 
because from the continuity of the objective function, 
we have V/ (A/) = ip}_ 1 (A/). Thus, we only need to 
compute the other endpoint objective value ipf (A^ +1 ). 
If ipj < ipf (Aj) = <p*, then we know ^* is the 
optimal solution; else let fi* = Xf +1 , 4>* = tpj (A/ +1 ), 
1 = 1 + 1 and continue. 

Since there are K ■ n r + 1 intervals in total, the search process 
takes at most K ■ n r + 1 steps to find the optimal solution fi* . 
Hence, this search is of polynomial-time complexity 0{n r K). 
After finding yu*, we can compute D* as 



D* = (D /i*I) + = U (A - fj,*T) + U f . 



(30) 



The projection of D onto fi+(p4ax) is summarized in Algo- 
rithm 2. 



Algorithm 2 Projection onto Q+(PmaL) 
Initiation: 

1. Construct a block diagonal matrix D. Perform eigenvalue decompo- 
sition D = UAUt, sort the eigenvalues in non-increasing order. 

2. Introduce Ao = oo and Xx-nt+l = Let 1 = 0. Let the 
endpoint objective value i/"f (^o) = 0, <f>* = 4>j (Ao), and /i* = Ao. 

Main Loop: 

1. If / > K ■ n r , go to the final step; else let = (5Zj=i ~~ P)/I- 

2. If fi*. £ [A| +1 , A|] nM+, then let /i* = /it and go to the final step. 

3. Compute i/>j(Aj +1 ). If ipj(\j +1 ) < (f>*, then go to the final step; 
else let /i* = A^ +1 , <f>* = tpj (Aj +1 ), 1 = 1 + 1 and continue. 

Final Step: Compute D as D = U (A - IJt. 



D. Solving the Master Dual Problem 

1) Cutting-Plane Method for Solving 0(u): The attractive 
feature of the cutting-plane method is its robustness, speed of 
convergence, and its simplicity in recovering primal feasible 
optimal solutions. The primal optimal feasible solution can be 
exactly computed by averaging all the primal solutions (may or 
may not be primal feasible) using the dual variables as weights 
[11]. Letting z = 6(u), the dual problem is equivalent to 

Minimize z 

subject to z > E/ In (s/) + Ei «J (Ri(Q) - (1, T T e ; >) 
u > 0, 

(31) 

where (S, T, Q) e 'J. Although (31) is a linear program with 
infinite constraints not known explicitly, we can consider the 
following approximating problem: 

Minimize z 

subject to z > £/ ln(sf) + £ ; u t (i?;(Q«)- n „ 

(It^K)) (32) 

u > 0, 

where the points (SW,TW,QW) e j = l,...,k-l. 
The problem in (32) is a linear program with a finite number 
of constraints and can be solved efficiently. Let (z^ k \ u^) 
be an optimal solution to the approximating problem, which 
we refer to as the master program. If the solution is feasible 
to (31), then it is an optimal solution to the Lagrangian dual 
problem. To check the feasibility, we consider the following 
subproblem: 

Maximize In (s f ) + u\ k) (i?,(Q) - (1, T T e ( )) 
subject to (S, T, Q) e * 

(33) 

Suppose that (S( fe ), T( fe) , Q( fc) ) is an optimal solution to the 
subproblem (33) and 0*(u( fe )) is the corresponding optimal 
objective value. If Zk > Q*(u^), then u' fc ^ is an optimal 
solution to the Lagrangian dual problem. Otherwise, for u = 
u( k \ the inequality constraint in (31) is not satisfied for 
(SO), T®, QW). Thus, we can add the constraint 

z > Eln(4 fe) ) +E U ' - <l.T< fc > r e«>) (34) 

/ i 



to (32), and re-solve the master linear program. Obviously, 
( 2 W, u ( fe )) violates (34) and will be cut off by (34). The 
cutting plane algorithm is summarized in Algorithm 3. 



Algorithm 3 Cutting Plane Algorithm for Solving D CRPA 

Initialization: 

Find a point (S<°',T( ',Q( 0) ) 6 Let k = 1. 
Main Loop: 

1. Solve the master program in (32). Let (zW , u( fc )) be an optimal 
solution. 

2. Solve the subproblem in (33). Let (S^' , T< fe ) , Q<») be an optimal 
point, and let 0*(u( fe )) be the corresponding optimal objective value. 

3. If z'k' > 0(u(' c '), then stop with u'*' as the optimal dual solution. 
Otherwise, add the constraint (34) to the master program, replace k 
by k + 1, and go to step 1. 



2) Subgradient Algorithm for Solving O(u): Since the 
Lagrangian dual objective function is piece-wise differentiable, 
subgradient method can also be applied. For O(u), starting 
with an initial and after evaluating subproblems 6 ne t(u) 
and 6ii n k for u( fe ) in the k th iteration, we update the dual 
variables by u^ fe+1 ' = [u fc — A^d^] , where the operator 
[•]+ projects a vector on to the nonnegative orthant, and 
denotes a positive scalar step size. d (»0 is a subgradient of 
the Lagrangian at point u( fc ). It is proved in [11] that the 
subgradient algorithm converges if the step size Afe satisfies 
Afe — > as k — > oo and 2~2"kLo ^ k ~ 00 ■ ^ simple and useful 
step size selection strategy is the divergent harmonic series 
Y^kLiPj = 00 > wnere /3 is a constant. The subgradient for 
the Lagrangian dual problem can be computed as 

i? ; (Q*(u))-(l,T*(u) T e ; ), / = l, 2 ,...,L. (35) 

Specifically, the subgradient method has the following proper- 
ties which make it possible to be implemented in a distributed 
fashion: 

1) Subgradient computation only requires local traffic in- 
formation (l,T T e/) and the available link capacity 
information Ri(Q) at each link L As a result, it can 
be computed locally. 

2) The choice of step size \ k — f3^ depends only upon the 
iteration index k, and does not require any other global 
knowledge. In conjunction with the first property, the 
dual variable, in the iterative form of u\ k+1 ^ = uf^ + 
Afe (90 (u) /dui), can also be computed locally. 

3) The objective functions 8ii n k can be decomposed on a 
node-by-node basis such that each node in the network 
can perform the computation in parallel. Likewise, the 
network layer subproblem 6 nc t can be decomposed on 
a source-by-source basis such that each source node can 
perform the routing computation locally after receiving 
the dual variable information of each link in the network. 

It is worth to point out that care must be taken when recov- 
ering the primal feasible optimal solution in the subgradient 
method. Generally, the primal variables in the dual optimal 
solution are not primal feasible unless the dual optimal solu- 
tion happens to be the saddle point. Fortunately, since CRPA-E 
is convex, its primal feasible optimal solution can be exactly 
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Fig. 5. A 15-node network example. 
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Fig. 6. Convergence behavior of the cutting-plane method 



computed by solving a linear programming problem (see [11] 
for further details). However, such a recovery approach cannot 
be implemented in a distributed fashion. In this paper, we 
adopt a variant of Shor's rule to recovery primal optimal 
feasible solution. Due to space limitation, we refer readers 
to [18] for more details. 

V. Numerical Results 

In this section, we present some numerical results through 
simulations to provide further insights on solving CRPA. 
N randomly-generated MIMO-enabled nodes are uniformly 
distributed in a square region. Each node in the network is 
equipped with two antennas. The maximum transmit power for 
each node is set to P max = 10dBm. Each node in the network 
is assigned a unit bandwidth. We illustrate a 15-node network 
example, as shown in Fig. 5, to show the convergence process 
of the cutting-plane and the subgradient methods for solving 
j-jCRPA-e j n exam pi e ^ there are three flows transmitting 
across the network: N14 to Nl, N6 to N10, and N5 to N4, 
respectively. 

A. Cutting-Plane Method 

For the 15-node example in Fig. 5, the convergence process 
for the cutting-plane method is illustrated in Fig. 6. The 
optimal objective value for this 15-node example is 6.72. The 
optimal flows for sessions N14 to Nl, N6 to N10, and N5 to 
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Fig. 7. Convergence behavior of the subgradient method 



N4 are 9.17 bps/Hz, 9.30 bps/Hz, and 9.93 bps/Hz, respec- 
tively. It can be observed that the cutting-plane algorithm is 
very efficient: It converges with approximately 160 cuts. As 
expected, the duality gap is zero because the convexity of the 
transformed equivalent problem based on dual MIMO-MAC. 

B. Subgradient Method 

For the 15-node example in Fig. 5, the convergence process 
for the subgradient method is illustrated in Fig. 7. The step 
size selection is Afe = 0.1/A:. The subgradient method also 
achieves the same optimal solution and objective value when 
it converges. However, it is seen that the subgradient algorithm 
takes approximately 1600 iterations to converge, which is 
much slower than the cutting-plane method. This is partially 
due to the heuristic nature in step size selection (cannot be too 
large or too small at each step). It is also partially due to the 
cumbersomeness in recovering the primal feasible solution in 
the subgradient method. In this example, the dual upper bound 
takes approximately 1050 iterations to reach near the optimal. 
However, the near-optimal primal feasible solution cannot be 
identified until after 1500 iterations. 

C. Comparison between BC and TDM 

We now study how much performance gain we can get by 
using Gaussian vector broadcast channel technique as opposed 
to the conventional time-division (TDM) scheme. The cross- 
layer optimization problem of MIMO-based mesh networks 
over TDM scheme is also a convex problem. Thus, the 
basic Lagrangian dual decomposition framework and gradient 
projection technique for the link layer subproblem are still 
applicable. The only difference is in the gradient computation, 
which is simpler in TDM case. For the same 15-node network 
with TDM, we plot the convergence process of the cutting- 
plane algorithm in Fig. 8. In TDM case, the optimal objective 
value is 5.01. For this example, we have 34.4% improvement 
by using DPC. 

VI. Related Work 

Despite significant research progress in using MIMO for 
single-user communications, research on multi-user multi-hop 
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Fig. 8. The convergence behavior in TDM case 



MIMO networks is still in its inception stage. There are many 
open problems, and many areas are still poorly understood 
[19]. Currently, the relatively well-studied research area of 
multi-user MIMO systems are cellular systems, which are 
single-hop and infrastructure-based. For multi-hop MIMO- 
based mesh networks, research results remain limited. In [20], 
Hu and Zhang studied the problem of joint medium access 
control and routing, with a consideration of optimal hop 
distance to minimize end-to-end delay. In [21], Sundaresan and 
Sivakumar used simulations to study various characteristics 
and tradeoffs (multiplexing gain vs. diversity gain) of MIMO 
links that can be leveraged by routing layer protocols in rich 
multipath environments to improve performance. In [22], Lee 
et al. proposed a distributed algorithm for MIMO-based multi- 
hop ad hoc networks, in which diversity and multiplexing 
gains of each link are controlled to achieve the optimal rate- 
reliability tradeoff. The optimization problem assumes fixed 
SINRs and fixed routes between source and destination nodes. 
However, in these works, there is no explicit consideration of 
per-antenna power allocation and their impact on upper layers. 
Moreover, DPC in cross-layer design has never been studied 
either. 

VII. Conclusions 

In this paper, we investigated the cross-layer optimization 
of DPC per-antenna power allocation and multi-hop multi- 
path routing for MIMO-based wireless mesh networks. Our 
contributions are three-fold. First, this paper is the first work 
that studies the impacts of applying dirty paper coding, which 
is the optimal transmission scheme for MIMO broadcast 
channels (MIMO-BC), to the cross-layer design for MIMO- 
based wireless mesh networks. We showed that the network 
performance has dramatic improvements compared to that 
of the conventional time-division/frequency division schemes. 
Second, we solved the challenging non-connvex cross-layer 
optimization problem by exploiting the channel duality be- 
tween MIMO-MAC and MIMO-BC, and we showed that 
transformed problem under dual MIMO-MAC is convex. We 
simplified the maximum weighted sum rate problem, thus 
paving the way for solving the link layer subproblem in 



the Lagrangian dual decomposition. Last, for the transformed 
problem, we develop an efficient solution procedure that 
integrates Lagrangian dual decomposition, conjugate gradient 
projection based on matrix differential calculus, cutting-plane, 
and subgradient methods. Our results substantiate the impor- 
tance of cross-layer optimization for MIMO-based wireless 
mesh networks with Gaussian vector broadcast channels. 
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